Search CORE

16 research outputs found

Improving Search through A3C Reinforcement Learning based Conversational Agent

Author: EL Deci
G Shani
H Cuayhuitl
H Cuayáhuitl
J Wei
JS Bridle
RS Sutton
S Hochreiter
Publication venue
Publication date: 19/08/2018
Field of study

We develop a reinforcement learning based search assistant which can assist users through a set of actions and sequence of interactions to enable them realize their intent. Our approach caters to subjective search where the user is seeking digital assets such as images which is fundamentally different from the tasks which have objective and limited search modalities. Labeled conversational data is generally not available in such search tasks and training the agent through human interactions can be time consuming. We propose a stochastic virtual user which impersonates a real user and can be used to sample user behavior efficiently to train the agent which accelerates the bootstrapping of the agent. We develop A3C algorithm based context preserving architecture which enables the agent to provide contextual assistance to the user. We compare the A3C agent with Q-learning and evaluate its performance on average rewards and state values it obtains with the virtual user in validation episodes. Our experiments show that the agent learns to achieve higher rewards and better states.Comment: 17 pages, 7 figure

arXiv.org e-Print Archive

Crossref

Nonstrict hierarchical reinforcement learning for interactive systems and robots

Author: Beck A.
Belpaeme T.
Betteridge J.
Crook P. A.
Cuayáhuitl H.
Cuayáhuitl H.
Cuayáhuitl H.
Cuayáhuitl H.
Cuayáhuitl H.
Daubigney L.
Dethlefs N.
Dethlefs N.
Dethlefs N.
Dethlefs N.
Dethlefs N.
Dethlefs N.
Heeman P.
Janarthanam S.
Keizer S.
Kruijff-Korbayová I.
Kruijff-Korbayová I.
Lemon O.
Li L.
Mitsunaga N.
Nalin M.
Pietquin O.
Schlangen D.
Thomaz A. L.
Williams J.
Young S.
Zue V.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/11/2014
Field of study

Conversational systems and robots that use reinforcement learning for policy optimization in large domains often face the problem of limited scalability. This problem has been addressed either by using function approximation techniques that estimate the approximate true value function of a policy or by using a hierarchical decomposition of a learning task into subtasks. We present a novel approach for dialogue policy optimization that combines the benefits of both hierarchical control and function approximation and that allows flexible transitions between dialogue subtasks to give human users more control over the dialogue. To this end, each reinforcement learning agent in the hierarchy is extended with a subtask transition function and a dynamic state space to allow flexible switching between subdialogues. In addition, the subtask policies are represented with linear function approximation in order to generalize the decision making to situations unseen in training. Our proposed approach is evaluated in an interactive conversational robot that learns to play quiz games. Experimental results, using simulation and real users, provide evidence that our proposed approach can lead to more flexible (natural) interactions than strict hierarchical control and that it is preferred by human users

University of Lincoln Institutional Repository

Crossref

A conversational system for multi-session child-robot interaction with several games

Author: Athanasopoulos G
Beck A
Cañamero L
Cosi P
Cuayáhuitl H
Demiris Y
Enescu V
Hiolle A
Kiefer B
Kruijff-Korbayová I
Racioppa S
Ros R
Sahli H
Schröder M
Sommavilla G
Tesser F
Verhelst W
Wang W
Publication venue
Publication date: 24/09/2011
Field of study

Spiral - Imperial College Digital Repository

Spatially-aware dialogue control using hierarchical reinforcement learning

Author: Becker T.
Burnett G.
Callaway C.
Cuayáhuitl H.
Cuayáhuitl H.
Cuayáhuitl H.
Denis M.
Dethlefs N.
Dethlefs N.
Heriberto Cuayáhuitl
Hochmair H. H.
Janarthanam S.
Kruijff G.
Lemon O.
Lemon O.
Münzer S.
Nina Dethlefs
Ohlbach H. J.
Prommer T.
Rieser V.
Russell S.
Wyatt J.
Young S.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/05/2011
Field of study

This article addresses the problem of scalable optimization for spatially-aware dialogue systems. These kinds of systems must perceive, reason, and act about the spatial environment where they are embedded. We formulate the problem in terms of Semi-Markov Decision Processes and propose a hierarchical reinforcement learning approach to optimize subbehaviors rather than full behaviors. Because of the vast number of policies that are required to control the interaction in a dynamic environment (e.g., a dialogue system assisting a user to navigate in a building from one location to another), our learning approach is based on two stages: (a) the first stage learns low-level behavior, in advance; and (b) the second stage learns high-level behavior, in real time. For such a purpose we extend an existing algorithm in the literature of reinforcement learning in order to support reusable policies and therefore to perform fast learning. We argue that our learning approach makes the problem feasible, and we report on a novel reinforcement learning dialogue system that performs a joint optimization between dialogue and spatial behaviors. Our experiments, using simulated and real environments, are based on a text-based dialogue system for indoor navigation. Experimental results in a realistic environment reported an overall user satisfaction result of 89%, which suggests that our proposed approach is attractive for its application in real interactions as it combines fast learning with adaptive and reasonable behavior

University of Lincoln Institutional Repository

Crossref

Proceedings of The 4th Workshop on Machine Learning for Interactive Systems

Author: Cuayáhuitl H.
Dethlefs N.
Frommberger L.
Pietquin O.
van Otterlo M.
Publication venue: Journal of Machine Learning Research (JMLR)
Publication date: 01/01/2015
Field of study

International audienceThe goal of this workshop is to bring researchers from multiple disciplines together who are in one way or another affected by the gap between perception, action and communication that typically exists for data-driven interactive systems or robots. We aim to provide a forum for interdisciplinary discussion that allows researchers to look at their work from new perspectives that go beyond their core community and potentially develop new interdisciplinary collaborations driven by machine learning. A multidisciplinary viewpoint is important to develop agents with a holistic perspective of the world. This is also vital for the design of agents that solve large-scale and complex real-world problems in a principled way. Machine learning will stand at the core of the workshop as a common interest across researchers

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot